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Abstract. In functional programming languages the use of infinite struc- 
tures is common practice. For total correctness of programs dealing with 
infinite structures one must guarantee that every finite part of the result 
can be evaluated in finitely many steps. This is known as productivity. 
For programming with infinite structures, productivity is what termina- 
tion in well-defined results is for programming with finite structures. 
Fractran is a simple Turing-complete programming language invented by 
Conway. We prove that the question whether a Fractran program halts 
on all positive integers is Il^-complete. In functional programming, pro- 
ductivity typically is a property of individual terms with respect to the 
inbuilt evaluation strategy. By encoding Fractran programs as specifi- 
cations of infinite lists, we establish that this notion of productivity is 
n2-complete even for some of the most simple specifications. Therefore 
it is harder than termination of individual terms. In addition, we explore 
generalisations of the notion of productivity, and prove that their com- 
putational complexity is in the analytical hierarchy, thus exceeding the 
expressive power of first-order logic. 



1 Introduction 

For programming with infinite structures, productivity is what termination is for 
programming with finite structures. In lazy functional programming languages 
like Haskell, Miranda or Clean the use of data structures, whose intended seman- 
tics is an infinite structure, is common practice. Programs dealing with such in- 
finite structures can very well be terminating. For example, consider the Haskell 
program implementing a version of Eratosthenes' sieve: 

prime n = primes ! ! (n-1) 
primes = sieve [2..] 

sieve (n:xs) = n: (sieve (filter (\m -> m 'mod' n /= 0) xs)) 

where prime n returns the n-th prime number for every n > 1. The func- 
tion prime is terminating, despite the fact that it contains a call to the non- 
terminating function primes which, in the limit, rewrites to the infinite list of 
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prime numbers in ascending order. To make this possible, the strategy with re- 
spect to which the terms are evaluated is crucial. Obviously, we cannot fully 
evaluate primes before extracting the n-th element. For this reason, lazy func- 
tional languages typically use a form of outermost-needed rewriting where only 
needed, finite parts of the infinite structure are evaluated, see for example [12]. 

Productivity captures the intuitive notion of unlimited progress, of 'work- 
ing' programs producing values indefinitely, programs immune to livelock and 
deadlock, like primes above. A recursive specification is called productive if not 
only can the specification be evaluated continually to build up an infinite normal 
form, but this infinite expression is also meaningful in the sense that it repre- 
sents an infinite object from the intended domain. The study of productivity (of 
stream specifications in particular) was pioneered by Sijtsma [14]. More recently, 
a decision algorithm for productivity of stream specifications from an expressive 
syntactic format has been developed [5] and implemented [4]. 

We consider various variants of the notion of productivity and pinpoint their 
computational complexity in the arithmetical and analytical hierarchy. In func- 
tional programming, expressions are evaluated according to an inbuilt evaluation 
strategy. This gives rise to productivity with respect to an evaluation strategy. We 
show that this property is n§-complete (for individual terms) using a standard 
encoding of Turing machines into term rewriting systems. Next, we explore two 
generalisations of this concept: strong and weak productivity. Strong productivity 
requires every outermost-fair rewrite sequence to 'end in' a constructor normal 
form, whereas weak productivity demands only the existence of a rewrite se- 
quence to a constructor normal form. As it turns out, these properties are of 
analytical complexity: II] and Sj-complete, respectively. 

Finally, we encode Fractran programs into stream specifications. In contrast 
to the encoding of Turing machines, the resulting specifications are of a very 
simple form and do not involve any computation on the elements of the stream. 
We show that the uniform halting problem of Fractran programs is Ilj-complete. 
(Although Turing-completeness of Fractran is folklore, the exact complexity has 
not yet been investigated before.) Consequently we obtain a strengthening of the 
earlier mentioned n2-completeness result for productivity. 

Fractran [2] is a remarkably simple Turing-complete programming language 
invented by the mathematician John Horton Conway. A Fractran program is a 
finite list of fractions ^ , . . . , ^ . Starting with a positive integer tiq , the algorithm 
successively calculates Ui^i by multiplying with the first fraction that yields 
an integer again. The algorithm halts if there is no such fraction. 

To illustrate the algorithm we consider an example of Conway from [2]: 

rrrei^2329779577J_ni^l^l555 
91' 85' 38' 33' M' 23' 19' 17' 13' 11 ' 14' Y' T 

We start with uq ~ 2. The leftmost fraction which yields an integer product 
is ^, and so rii = 2 • ^ = 15. Then we get n2 = 15 • ^ = 825, etcetera. By 
successive application of the algorithm, we obtain the following infinite sequence; 



2, 15, 825, 725, 1925, 2275, 425, 390, 330, 290, 770, . 
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Apart from 2^, the powers of 2 occurring in this infinite sequence are 2^, 2^^, 2^, 2^, 
2ii^ 2^^, 2^^, 2^^, . . ., where the exponents form the sequence of primes. 

We translate Fractran programs to stream specifications in such a way that 
the specification is productive if and only if the program halts on all no > 1. 
Let us define the target format of this translation: the lazy stream format (LSF). 
LSF consists of stream specifications of the form M — > C[M] where C is a context 
built solely from: one data element the stream constructor ':', the functions 
head(a:: : cr) — s- x and tail(a; : cr) ^ ci, unary stream functions mod„, and fc-ary 
stream functions with the following defining rules, for every n, k > 1: 

mod„(cr) head(cr) : mod„(tail"(cr)) (LSF) 
ziPfc(o'i:'^2 ■ • ■ ,crfc) head((Ti) : z\p^{a2, ■ ■ . , cTfe, tail(CTi)) 

By reducing the uniform halting problem of Fractran programs to productivity 
of LSF, we get that productivity for LSF is E^-complete. 

This undecidability result stands in sharp contrast to the decidability of 
productivity for the pure stream format (PSF, [5]). Let us elaborate on the 
difference between these two formats. Examples of specifications in PSF are: 

J ^ : 1 : even(J) and Z ^ : zip(even(Z), odd(Z)) , 

including the defining rules for the stream functions involved: 

even(a; : a) ^ x : odd(cr) , odd(a; : a) even(<T) , zip(.T : cr, r) x : zip(r, cr) , 

where zip 'zips' two streams alternatingly into one, and even (odd) returns a 
stream consisting of the elements at its even (odd) positions. The specification 
for Z produces the stream 0:0:0:... of zeros, whereas the infinite normal form 
of J is : 1 : : : even", which is not a constructor normal form. 

Excluded from PSF is the observation function on streams head(x : a) ^ x. 
This is for a good reason, as we shall see shortly. PSF is essentially layered: 
data terms (terms of sort data) cannot be built using stream terms (terms of 
sort stream). As soon as stream dependent data functions are admitted, the 
complexity of the productivity problem of such an extended format is increased. 
Indeed, as our Fractran translation shows, productivity of even the most simple 
stream specifications is undecidable and H^-hard. The problem with stream de- 
pendent data functions is that they possibly create 'look-ahead': the evaluation 
of the 'current' stream element may depend on the evaluation of 'future' stream 
elements. To see this, consider an example from [14]: 

S„ ^ : S„(7i) : S„ 

where for a term t of sort stream and n £ N, we write t{n) as a shorthand for 
head(tail"(t)). If we take n to be an even number, then S„ is productive, whereas 
it is unproductive for odd n. 

A hint for the fact that it is Il^-hard to decide whether a lazy specification is 
productive already comes from a simple encoding of the CoUatz conjecture (also 
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known as the 'Sx+l-problcm' [11]) into a productivity problem. Without proof 
we state: the Collatz conjecture is true if and if only the following specification 
produces the infinite chain of data elements •: 



In order to understand the operational difference between rules in PSF and rules 
in LSF. consider the following two rules: 



The functions defined by these rules are extensionally equivalent: they both im- 
plement the identity function on fully developed streams. However, intensionally, 
or operationally, there is a difference. A term read'(s) is a redex only in case s 
is of the form u : t, whereas read(s) constitutes a redex for every stream term s, 
and so head(s) can be undefined. The 'lazy' rule (2) postpones pattern matching. 
Although in PSF we can define functions mod^ and zipj. extensionally equiva- 
lent to mod„ and zip^,, a pure version collatz' of collatz in (1) above (using modg 
and zip2 instead) can easily be seen to be not productive (it produces two data 
elements only), and to have no bearing on the Collatz conjecture. 

Contribution and Overview. In Section 2 we show that the uniform halting prob- 
lem of Fractran programs is fl^-complete. This is the problem of determining 
whether a program terminates on all positive integers. Turing-completeness of 
a computational model does not imply that the uniform halting problem in the 
strong sense of termination on all configurations is Ilj-complete. For example, 
assume that we extend Turing machines with a special non-terminating state. 
Then the computational model obtained can still compute every recursive func- 
tion. However, the uniform halting problem becomes trivial. 

Our result is a strengthening of the result in [10] where it has been shown 
that the generalised Collatz problem (GCP) is H^-complete. This is because 
every Fractran program P can easily be translated into a Collatz function / 
such that the uniform halting problem for P is equivalent to the GCP for /. The 
other direction is not immediate, since Fractran programs form a strict subset 
of Collatz functions. We discuss this in more detail in Section 2. 

In Section 3 we explore alternative definitions of productivity and make them 
precise in the framework of term rewriting. These can be highly undecidable: 
'strong productivity' turns out to be Hj-complete and 'weak productivity' is 
E}-complete. Productivity of individual terms with respect to a computable 
strategy, which is the notion used in functional programming, is Hj-complete. 

In Section 4 we prove that productivity H2-complete even for specifications 
of the restricted LSF format. The new proof uses a simple encoding of Fractran 
programs P into stream specifications of the form Mp ^ C[Mp], in such a way 
that Mp is productive if and only if the program P halts on all inputs. The 
resulting stream specifications are very simple compared to the ones resulting 



collatz • : zip2(collatz, mod6(tail (collatz))) 



(1) 



read(tT) head(o-) : read(tail(cr)) 
read'(a: : a) x : read'(o') 



(2) 
(3) 



Complexity of Fractran and Productivity 



5 



from encoding of Turing machines employed in Section 3. Whereas the Turing 
machine encoding essentially uses calculations on the elements of the list, the 
specifieations obtained from the Fractran encoding contain no operations on the 
list elements. In particular, the domain of data elements is a singleton. 

Related Work. In [3] undecidability of different properties of first-order TRSs is 
analysed. While the standard properties of TRSs turn out to be either Ej- or 
n^-complete, the complexity of the dependency pair problems [1] is essentially 
analytical: it is shown to be IlJ-complete. We employ the latter result as a basis 
for our n}- and Sj-completeness results for productivity. 

Ro§u [13] shows that equality of stream specifications is Ilj-completc. We 
remark that this result can be obtained as a corollary of our translation of 
Fractran programs P to stream specifications Mp. Stream specifications Mp have 
the stream •:•:... as unique solutions if and only if they are productive. Thus 
n§-completeness of productivity of these specifications implies Il^-completeness 
of the stream equality problem M p = •:•:... . 

One of the reviewers pointed us to recent work [6] of Grue Simonsen (not 
available at the time of writing) where Ilj-completeness of productivity of or- 
thogonal stream specifications is shown. Theorem 3.5 below can be seen as a 
sharpening of that result in that we consider general TRSs and productivity 
with respect to arbitrary evaluation strategies. For orthogonal systems the eval- 
uation strategy is irrelevant as long as it is outermost-fair. Moreover we further 
strengthen the result on orthogonal stream specifications by restricting the for- 
mat to LSF. 

2 Fractran 

The one step computation of a Fractran program is a partial function. 

Definition 2.1. Let P = . . . , ^ be a Fractran program. The partial func- 
tion /p : N ^ N is defined for all n e N by: 

f n • 2i where is the first fraction of P such that n • — 6 N, 

I undefined if no such fraction exists. 

We say that P halts on n G N if there exists i G N such that fp (n) = undefined. 
For n, m G N we write n -^p m whenever m = fp{n). 

The Fractran program for generating prime numbers, that we discussed in the 
introduction, is non-terminating for all starting values no, because the product 
of any integer with ^ is an integer again. However, in general, termination of 
Fractran programs is undecidable. 

Theorem 2.2. The uniform halting problem for Fractran programs, that is, de- 
ciding whether a program halts for every starting value no G N>o, is Il2-complete. 
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A related result is obtained in [10] where it is shown that the generalised CoUatz 
problem (GCP) is H^-complete, that is, the problem of deciding for a CoUatz 
function / whether for every integer a; > there exists i S N such that f''{x) = 1. 
A CoUatz function / is a function / : N — > N of the form: 



for some p G N and rational numbers Ui, hi such that /(n) S N for all n G N. 

The result of [10] is an immediate corollary of Theorem 2.2. Every Fractran 
program P is a CoUatz function f'p where f'p is obtained from fp (see Defi- 
nition 2.1) by replacing undefined with 1. We obtain the above representation 
of CoUatz functions simply by choosing for p the least common multiple of the 
denominators of the fractions of P. We call a Fractran program P trivially im- 
mortal if P contains a fraction with denominator 1 (an integer). Then for all not 
trivially immortal P, P halts on all inputs if and only for all a; > there exists 
i S N such that fp{x) = 1. Using our result, this implies that GCP is E^-hard. 

Theorem 2.2 is a strengthening of the result in [10] since Fractran programs 
are a strict subset of CoUatz functions. If Fractran programs are represented as 
CoUatz functions directly, for all < i < p it holds either hi = 0, or = and 
hi = 1. Via such a translation Fractran programs are, e.g., not able to implement 
the famous CoUatz function C(2n) = n and C{2n + 1) = 6n + 4 (for all n G N), 
nor an easy function like f{2n) = 2n + 1 and f{2n + 1) = 2n (for all n G N). 

For the proof of Theorem 2.2 we devise a translation from Turing machines to 
Fractran programs ([10] uses register machines) such that the resulting Fractran 
program halts on all positive integers (no > 1) if and only if the Turing machine is 
terminating on all configurations. That is, we reduce the uniform halting problem 
of Turing machines to the uniform halting problem of Fractran programs. 

We briefly explain why we employ the uniform halting problem instead of the 
problem of totality (termination on all inputs) of Turing machines, also known 
as the initialised uniform halting problem. When translating a Turing machine 
M to a Fractran program Pa/, start configurations (initialised configurations) 
are mapped to a subset Im C N of Fractran inputs. Then from Hj-hardness of 
the totality problem one can conclude H^-hardness of the question whether Pm 
terminates on all numbers from Im- But this does not imply that the uniform 
halting problem for Fractran programs is Ilj-hard (termination on all natural 
numbers n G N) . The numbers not in the target of the translation could make the 
problem both harder as well as easier. A situation where extending the domain 
of inputs makes the problem easier is: local confluence of TRSs is n2-complete 
for the set of ground terms, but only S^-complete for the set of all terms [3]. 

To keep the translation as simple we restrict to unary Turing machines having 
only two symbols {0, 1} in their tape alphabet. being the blank symbol. 

Definition 2.3. A unary Turing machine M is a triple {Q, qo, S), where Q is a, 
finite set of states, qo & Q the initial state, and S : Qx{0,l} ^ Qx{0, l}x{L, R} 




if n = (mod p) 



if n — p — I (mod p) 
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a (partial) transition function. A configuration of M is a pair (g, tape) consisting 
of a state q £ Q and the tape content tape : Z {0, 1} such that the support 
{n e Z I tape{n) ^ 0} is finite. The set of aU configurations is denoted by 
Confj^j. We define the relation — >m on the set of configurations Confj^^ as follows: 
(g, tape) -^m (<?', tape') whenever: 

— 5{q, tape{0)) = {q' , f, L), tape'{l) = / and Vn ^ 0. tape' [n + 1) = tape{n), or 

— d{q, tape{0)) = {q' , /, R), tape'{—l) ~ f and Vn 7^ 0. tape' [n — 1) = tape{n). 

We say that A/ halts (or terminates) on a configuration (q, tape) if the configu- 
ration {q, tape) does not admit infinite —^M rewrite sequences. 

The uniform halting problem of Turing machines is the problem of deciding 
whether a given Turing machine Af halts on all (initial or intermediate) config- 
urations. The following theorem is a result of [7]: 

Theorem 2.4. The uniform halting problem for Turing machines is Il2-complete. 

This result carries over to unary Turing machines using a simulation based on a 
straightforward encoding of tape symbols as blocks of zeros and ones (of equal 
length), which are admissible configurations of unary Turing machines. 

We now give a translation of Turing machines to Fractran programs. Without 
loss of generality we restrict in the sequel to Turing machines M = {Q,qo,S) for 
which 6{q, x) = (g', s' , d!) implies q ^ q' . In case M does not fulfil this condition 
then we can find an equivalent Turing machine M' — {Q \J Q^,qo,S') where 
Q# = {q^ \ q £ Q} and 6' is defined by 5'{q,x) — (j)^,s,d) and S'{q^,x) = 
(p, s, d) for S{q, x) ~ {p, s, d). 

Definition 2.5. Let AI = {Q,qo,S) be a Turing machine. Let tapcg, h, tape^., 
tape'g, h' , tape'^, itil^x, 'mFi.,x, copy^ and Pq for every q € Q and x € {0, 1} be 
pairwise distinct prime numbers. The intuition behind these primes is: 

— tapCf and tape^. represent the tape left and right of the head, respectively, 

— h is the tape symbol in the cell currently scanned by the tape head, 

— tape'g, h' , tape'^ store temporary tape content (when moving the head), 

^ f^L,x, "m-R^x execute a left or right move of the head on the tape, respectively, 

— copUx copies the temporary tape content back to the primary tape, and 

— Pq represent the states of the Turing machine. 

The subscript x G {0, 1} is used to have two primes for every action: in case an 
action p takes more than one calculation step we cannot write since then p 
in numerator and denominator would cancel itself out. We defme the Fractran 
program Pm to consist of the following fractions (listed in program order) : 



to get rid of illegal configurations, 

TOL,i-a ■ tape'g mL,i-x ■ tape'^ mL,i~x ■ tape'^ mL,i-x ■ h copy^ 



p-p' 



1 



for every p,p' £ {ruLfi, rnL,i, m.Rfi, mR,i, copy^, copy^} 
every p,p' & {pq\q<£Q} and p,p' G {/i, h'} 



(4) 



mL,x ■ tapej 



fTiL^x ■ tapCj. 



mL,x 



■ h' 



mL,x • tape I tul 



>x 



8 Jorg EndruUis, Clemens Grabmayer, and Dimitri Hendriks 

with X G {0, 1}, for moving the head left on the tape, 

rriRA-x ■ tape'^ tura-^x ■ tapef mR,i~x ■ tape'^ mR^i^x ■ h copy^ 
rriR^x ■ tapef mR^x ■ tape^ mR^x ■ h' mR^x ■ tape^ mR^x 

with X e {0, 1}, for moving the head right on the tape, 

copyi_^ ■ tape^ copy^_^ ■ tape^ 1 

copy^ ■ tape'^ copy^ ■ tape'j. copy^ 

with X G {0, 1}, for copying the temporary tape back to the primary tape, 
Pq' ■ h'"' ■ nidfi 



(6) 



(7) 



Pq ■ h 
1 

Pq ■ h 

Pq: ■ h'"' ■ rudfi 

Pq 



whenever 5{q, 1) — (g', s' , d) (8) 
(for termination) for every q £ Q (9) 

whenever (5(g, 0) = (g', s', d) (10) 



for the transitions of the Turing machine. Whenever we use variables in the 
rules, e.g. x £ {0, 1}, then it is to be understood that instances of the same rule 
are immediate successors in the sequence of fractions (the order of the instances 
among each other is not crucial) . 

Example 2.6. Let M = {Q,ao,5) be a Turing machine where Q = {ao,ai,5}, 
and the transition function is defined by (5(ao, 0) = (6, 1, R), (5(ai, 0) = (6, 1, i?), 
(5(ao, 1) = (ai, 0, i?), (5(ai, 1) = (oo, 0, R), 5{h, 1) = (ag, 0, i?), and we leave (5(6, 0) 
undefined. That is, M moves to the right, converting zeros into ones and vice 
versa, until it finds two consecutive zeros and terminates. Assume that M is 
started on the configuration 161001, that is, the tape content 11001 in state 
h with the head located on the second 1. In the Fractran program Pm this 
corresponds to hq = ph ■ tape] ■ ■ iape^*^" as the start value where we represent 
the exponents in binary notation for better readability. Started on uq we obtain 
the following calculation in Pm- 

Ph ■ tape] ■ ■ tapel'^'^ (configuration 161001) 
^(8) rriRfl -paa ■ tape] ■ tapel^° ^(6;P') ™-R.,o ■ Pa,, ' tape] ■ tape'^° 
^(6;2"d) rnR,i -pao ' tape'l" ■ tape'^" -^(6;5'h) copy„ ■ pag ■ tape'l^ ■ tape'^^ 
~*(7;i=')^(7;2"d)^(7;3'-<i) Pao ' tape]° ■ tape]." (configuration lOaOOl) 

10 111 , „10 , ™ „ , „10 L/l 



^{10) ^^R.o ■ Pb ■ tape ( -h ■ tape^ -^(^g-i^t^ niR^i ■ pi, ■ tapCf -h ■ tape^ 
2nd) niR^i ■ pb ■ tape'l°° ■ h'^ ■ tape'^ -^(a-y^+b^^) copyQ ■ Pb ■ tape^^^ ■ tc 
^(7;i=')^(7;2"'i)-^(7;3rd) Pb ■ tape]'^^ ■ tape]. (configuration 101601) 



reaching a configuration where the Fractran program halts. 
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Definition 2.7. Wc translate configurations c ~ {q, tape) of Turing machines 
M = {Q,qo,6) to natural numbers (input values for Fractran programs). We 
reuse the notation of Definition 2.5 and define: 

ric = tapef ■ pq ■ ■ tape^ 

oo oo 

L = ^2'-iape(-l-i) = iape(O) R = ^T ■ tape{l + i) 

i=0 i=0 

Lemma 2.8. For every Turing machine M and configurations Ci, C2 we have: 



(i) if Ci -^M C2 then -^*p^^ n^^, and 
(a) if Ci is a — >A/ normal form then Ji^ 



undefined. 



Proof. Let M — (Q, qa,S), and Ci = {qi, tape^) for i G {1, 2}. Then for i S {1, 2} 
we have = tape^' ■ pq. ■ h^^ ■ tape^' with Li, Hi and Ri as L, H and R in 
Definition 2.7. For (i) assume that ci ~*m c^. By the definition of -^m there are 
two cases: the head moves left or right. We consider 'moving left' ('moving right' 
is analogous). Then (5(gi, ^ape]^(0)) = {q2,s,L), tape2{l) = s, Vn 7^ 0. tape2{n + 
1) = tapei{n). Therefore L2 = [Li/2J. H2 = tapei{~l) = Li mod 2 and 
i?2 = s + 2 • If tape^{0) = 1 then Hi ^ 1 and therefore the first fraction of 
Pm applicable to rtci is {pq2 ■ h"^ ■ mL^) / {pq^ ■ h); otherwise if tapei{0) = then 
Hi = and the fraction is {pq^ ■ h'^ ■ mL,o)/{Pqi)- Both cases result in (numbers 
in the justification refer to fractions of Definition 2.5): 



Pm 


Pq2 


tape^' ■ h" 


tape^' ■ mL,o 


((8) or (10)) 


Pm 


Pq2 




• tape^^'^'^^ ■ h'^ ■ tape^' ■ m^^x 


(1^' of (5)) 


+ 

Pm 


Pq2 


tapef' 


■ tape^'"'^'^^ ■ h'^ ■ tape'^'^' ■ mL,y 


(2"^^ of (5)) 


<1 
Pm 


Pq2 


I Li mod 2 

tape^ 


■ tape^^'^'^^ ■ tape'^'^^'^' ■ mL,y 


(3'-' of (5)) 


<1 
Pm 


Pq2 




. tape'^'+^-'^' ■ ruL,, 


(4'h of (5)) 


<1 
Pm 


Pq2 






(5'h of (5)) 


+ 

Pm 


Pq2 




/i^i-°d2.^^^g.+2.i?, ^^^^ 


((7)) 



For (ii) assume that ci is a —>-m normal form. Then 6{qi, tapei{0)) is undefined. 
If tape i{0) = 1 then Hi = 1. Since there is no matching fraction (8), the first 
applicable fraction is from (9), which removes pq-^ and thus leads to termination. 
If tapei{0) — then Hi ^ thus the only applicable fractions can be among (10) 
however there is no matching fraction since S{qi, tape-^^{0)) is undefined. □ 

Finally, we prove Theorem 2.2 which states that the uniform halting problem 
for Fractran programs is n2-complete. 

Proof (Theorem 2.2). For Il^-hardness use the uniform halting problem of Turing 
machines which is Hj-complete (see Theorem 2.4). Let M be a Turing machine. 
We prove that M halts on all configurations if and only if Pm halts on all 
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integers n > 0. If there is a configuration c on which M does not hah, then the 
Fractran program P/v/ does not hah on Uc by Lemma 2.8 (i). Thus assume that 
M halts on all configurations. Let C = {m^^Oi w^.i, mn ^^ "Tifj^i, copyQ, copiji}. 
Let n > be arbitrary. By Lemma 2.8 it suffices to show that n — >p undefined 
or n — >p He for some configuration c. By the first fractions of Pm we have 
n — >Pj,^ n' such that n' contains at most one prime factor from C, at most one 
Pq and at most one {h, h'} (and none of these primes has an exponent > 1). 

Assume that n' contains or to^. The or m^, fractions cannot be ap- 
plied infinitely often in sequence since they decrease all exponents of tape'^, tape'^ 
and h' to 0, respectively. After rriL or m^j there always follows copy which then 
increases tape^, tape^,, but decreases tape'^, tape'^ to the value and afterwards 
copy 'removes' itself. We call the reached configuration n" . Then n" contains 
only the prime factors tapcg, tape,, with exponent > 0, /i with exponent < 1 and 
at most one of the pq with exponent < 1. If n" does not contain any pq then 
n" -^Pm undefined. Otherwise there exists a configuration c such that n" = ric- 

If n' does not contain m^, or m^, then neither the fractions (4), nor (7) 
(copy), can be applied infinitely often in sequence. Application of (9) removes 
the only remaining pq and thus leads to termination. Thus any non-terminating 
sequence contains an application of (8) or (10), which brings us back to the case 
of n' containing or mn which we have already analysed. □ 

3 What is Productivity? 

A program is productive if it evaluates to a finite or infinite constructor normal 
form. This rather vague description leaves open several choices that can be made 
to obtain a more formal definition. We explore several definitions and determine 
the degree of undecidability for each of them. See [5] for more pointers to the 
literature on productivity. 

The following is a productive specification of the (infinite) stream of zeros: 

zeros : zeros 

Indeed, there exists only one maximal rewrite sequence from zeros and this ends 
in the infinite constructor normal form 0:0:0:.... Here and later we say that 
a rewrite sequence p : io ^ ti — > ^2 ^ ■ • • ends in a term s if either p is finite 
with its last term being s, or p is infinite and then s is the limit of the sequence 
of terms ti, i.e. s = limi^co ti. We consider only rewrite sequences starting from 
finite terms, thus all terms occurring in p are finite. Nevertheless, the limit s 
of the terms ti may be an infinite term. Note that, if p ends in a constructor 
normal form, then every finite prefix will be evaluated after finitely many steps. 
The following is a slightly modified specification of the stream of zeros: 

zeros : id (zeros) id(o') — > a 

This specification is considered productive as well, although there are infinite 
rewrite sequences that do not even end in a normal form, let alone in a construc- 
tor normal form: e.g. by unfolding zeros only we get the limit term : id(0 : id(0 : 
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id(. . .))). In general, normal forms can only be reached by outermost-fair rewrit- 
ing sequences. A rewrite sequence p : to ^ ^ ^2 ^ ■ ■ • is outermost-fair [15] if 
there is no t„ containing an outermost redex which remains an outermost redex 
infinitely long, and which is never contracted. For this reason it is natural to 
consider productivity of terms with respect to outermost-fair strategies. 

What about stream specifications that admit rewrite sequences to construc- 
tor normal forms, but that also have divergent rewrite sequences: 

maybe ^ : maybe maybe sink sink sink 

This example illustrates that, for non-orthogonal stream specifications, reach- 
ability of a constructor normal form depends on the evaluation strategy. The 
term maybe is only productive with respect to strategies that always apply the 
first rule. 

For this reason we propose to think of productivity as a property of indi- 
vidual terms with respect to a given rewrite strategy. This refiects the situation 
in functional programming, where expressions are evaluated according to an in- 
built strategy. These strategies are usually based on a form of outermost-needed 
rewriting with a priority order on the rules. 

3.1 Productivity with respect to Strategies 

For term rewriting systems (TRSs) [15] we now fix definitions of the notions of 
(history-free) strategy and history-aware strategy. Examples for the latter notion 
are outermost-fair strategies, which typically have to take history into account. 

Definition 3.1. Let i? be a TRS with rewrite relation -^r. 

A strategy for is a relation ^ C — >^ with the same normal forms as -^r. 

The history-aware rewrite relation —>-h.R for R is the binary relation on 
Ter(Z') X (Rx N*)* that is defined by: 

(s, kg) -^■}i,R {t, hs ■ {PtP)) *^=^ s ^ t via rule p E R at position p . 

We identify t e Ter{S) with (i, e), and for s,t G Ter{S) we write s -^n.R t 
whenever (s,e) -^h.r for some history h G {R. x N*)* . A history-aware 

strategy for R is a strategy for —>u,R- 

A strategy ^ is deterministic if s ^ < and s ^ t' implies t = t' . A strategy 
^ is computable if the function mapping a term (a term/history pair) to its set 
of ^-successors is a total recursive function, after coding into natural numbers. 

Remark 3.2. Our definition of strategy for a rewrite relation follows [16]. For 
abstract rewriting systems, in which rewrite steps are first-class citizens, a def- 
inition of strategy is given in [15, Ch. 9]. There, history-aware strategies for a 
TRS R are defined in terms of 'labellings' for the 'abstract rewriting system' 
underlying R. While that approach is conceptually advantageous, our definition 
of history-aware strategy is equally expressive. 
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Definition 3.3. A (TRS-indexed) family of strategies 5 is a function tliat as- 
signs to every TRS R a set S{R) of strategies for R. We call such a family S of 
strategies admissible if S{R) is non-empty for every orthogonal TRS R. 

Now we give the definition of productivity with respect to a strategy. 

Definition 3.4. A term t is called productive with respect to a strategy ^ if all 
maximal rewrite sequences starting from t end in a constructor normal form. 

In the case of non-deterministic strategies we require liere that all maximal 
rewrite sequences end in a constructor normal form. Another possible choice 
could be to require only the existence of one such rewrite sequence (see Sec- 
tion 3.2). However, we think that productivity should be a practical notion. 
Productivity of a term should entail that arbitrary finite parts of the constructor 
normal form can indeed be evaluated. The mere requirement that a constructor 
normal form exists leaves open the possibility that such a normal form cannot 
be approximated to every finite precision in a computable way. 

For orthogonal TRSs outermost-fair (or fair) rewrite strategies are the natural 
choice for investigating productivity because they guarantee to find (the unique) 
infinitary constructor normal form whenever it exists (see [15]). 

Pairs and finite lists of natural numbers can be encoded using the well-known 
Godel encoding. Likewise terms and finite TRSs over a countable set of variables 
can be encoded. A TRS is called finite if its signature and set of rules are finite. 
In the sequel we restrict to (families of) computable strategies, and assume that 
strategies are represented by appropriate encodings. 

Now we define the productivity problem in TRSs with respect to families of 
computable strategies, and prove a Ilj-completeness result. 

Productivity Problem with respect to a family S of computable strategies 
Instance: Encodings of a finite TRS i?, a strategy e S{R) and a term t. 
Answer: 'Yes' if t is productive with respect to 

Theorem 3.5. For every family of admissible, computable strategies S, the pro- 
ductivity problem with respect to S is Il2-complete. 

Proof. A (deterministic) Turing machine is called total (encodes a total function 
N ^ N) if it halts on all inputs encoding natural numbers. The problem of 
deciding whether a deterministic Turing machine is total is well-known to be 
Ilj-complete, see [8]. Let M be an arbitrary Turing macliine. Employing the 
encoding of deterministic Turing machines into orthogonal TRSs from [9], we 
can define a TRS Rm that simulates M such that for every n e N it holds: 
every reduct of the term M(s"(0)) contains at most one redex occurrence, and 
the term M(s"(0)) rewrites to if and only if the Turing machine M halts on the 
input n. Note that the rewrite sequence starting from M(s"(0)) is deterministic. 
We extend the TRS Rm to a TRS R^j with the following rule: 

go(0,a;) ^0:go(M(x),s(.T)) 
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and choose the term t ~ go(0, 0). Then is orthogonal and by construction 
every reduct of t contains at most one redcx occurrence (consequently all strate- 
gies for R coincide on every reduct of t). The term t is productive if and only 
if M(s"(0)) rewrites to for every n £ N which in turn holds if and only if the 
Turing machine M is total. This concludes Hj-hardness. 

For Hj-completeness let 5 be a family of computable strategies, R a TRS, £ 
iS(i?) and t a term. Then productivity of t can be characterised as: 

Vd G N. 3n e N. every n-step ^-reducts of t , , 

is a constructor normal form up to depth d 

Since the strategy ^ is computable and finitely branching, all n-step reducts of t 
can be computed. Obviously, if the formula {-k) holds, then t is productive w.r.t. 

Conversely, assume that t is productive w.r.t. For showing (★), let d G N 
be arbitrary. By productivity oit w.r.t. on every path in the reduction graph 
of t w.r.t. ^ eventually a term with a constructor normal form up to depth d 
is encountered. Since reduction graphs in TRSs always are finitely branching, 
Koenig's lemma implies that there exists an n S N such that all terms on depth 
greater or equal to n in the reduction graph of t are constructor prefixes of depth 
at least d. Since d was arbitrary, (★) has been established. Because (★) is a II^- 
formula, the productivity problem with respect to S also belongs to IIj. □ 

Corollary 3.6. In orthogonal TRSs, productivity is H^-complete with respect to 
outermost-fair rewriting. 

Proof. Apply Theorem 3.5 to the family of strategies that assigns to every or- 
thogonal TRS R the set of computable, outermost-fair rewriting strategies for 
R, and to non-orthogonal TRSs. 

The definition of productivity with respect to computable strategies reflects 
the situation in functional programming. Nevertheless, we now investigate vari- 
ants of this notion, and determine their respective computational complexity. 

3.2 Strong Productivity 

As already discussed, only outermost-fair rewrite sequences can reach a con- 
structor normal form. Dropping the fine timing device 'strategies', we obtain the 
following stricter notion of productivity. 

Definition 3.7. A term t is called strongly productive if all maximal outermost- 
fair rewrite sequences starting from t end in a constructor normal form. 

The definition requires all outermost-fair rewrite sequences to end in a construc- 
tor normal form, including non-computable rewrite sequences. This catapults 
productivity into a much higher class of undecidability: 11} , a class of the an- 
alytical hierarchy. The analytical hierarchy continues the classification of the 
arithmetical hierarchy using second order formulas. The computational complex- 
ity of strong productivity therefore exceeds the expressive power of first-order 
logic to define sets from recursive sets. 
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A well-known result of recursion theory states that for a given computable 
relation > C N x N it is n}-hard to decide whether > is well-founded, see [8] . Our 
proof is based on a construction from [3] . There a translation from deterministic 
Turing machines M to TRSs TZootM (which we explain below) together with a 
term Im is given such that: Im is root-terminating (i.e., Im admits no rewrite 
sequences containing an infinite number of root steps) if and only if the binary 
relation >m encoded by M is well-founded. The TRS TZooti\j consists of rules 
for simulating the Turing machine M such that M{x,y) — >* T iff x >m y holds 
(which basically uses a standard encoding of deterministic Turing machines, 
see [9]), a rule: 

run(T, ok(a;), ok(y)) run(A/(a;, y), ok(y), pickn) 

and rules for randomly generating a natural number: 

pickn c(pickn) pickn ok(0(>)) c(ok(a;)) — > ok(S(a;)) . 

The term Im = run(T, pickn, pickn) admits a rewrite sequence containing in- 
finitely many root steps if and only if > m is not well-founded. More precisely, 
whenever there is an infinite decreasing sequence xi >m ^2 >ai x^ >m ■ ■ then 
Im admits a rewrite sequence run(T, pickn, pickn) run(T, ok(a:i), ok(a;2)) — > 
run(M(a;i, a;2), ok(a;2), pickn) run(T, ok(a;2), ok(a;3)) .... We further note 
that tM and all of its reducts contain exactly one occurrence of the symbol run, 
namely at the root position. 

Theorem 3.8. The recognition problem for strong productivity is H\-complete. 

Proof. For the proof of IlJ-hardness, let M be a deterministic Turing machine. 
We extend the TRS TZootM from [3] with the rule run(a;, y, z) ^ : run(x, y, z). 
As a consequence the term run(T, pickn, pickn) is strongly productive if and only 
if >M is well-founded (which is n}-hard to decide). If >m is not well-founded, 
then by the result in [3] t^j admits a rewrite sequence containing infinitely many 
root steps which obviously does not end in a constructor normal form. On the 
other hand if >m is well-founded, then Im admits only finitely many root steps 
with respect to TZootM, and thus by outermost-fairness the freshly added rule 
has to be applied infinitely often. This concludes n}-hardncss. 

Rewrite sequences of length uj can be represented by functions r : N — > N 
where r{n) represents the n-th term of the sequence together with the position 
and rule applied in step n. Then for all r (one universal Vr function quantifier) 
we have to check that r converges towards a constructor normal form whenever 
r is outermost-fair; this can be checked by a first order formula. We refer to [3] 
for the details of the encoding. Hence strong productivity is in II}. □ 

3.3 Weak Productivity 

A natural counterpart to strong productivity is the notion of 'weak produc- 
tivity': the existence of a rewrite sequence to a constructor normal form. Here 
outermost-fairness does not need to be required, because rewrite sequences that 
reach normal forms are always outermost-fair. 
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Definition 3.9. A term t is called weakly productive if there exists a rewrite 
sequence starting from t that ends in a constructor normal form. 

For non-orthogonal TRSs the practical relevance of this definition is questionable 
since, in the absence of a computable strategy to reach normal forms, mere 
knowledge that a term t is productive does typically not help to find a constructor 
normal form of t. For orthogonal TRSs computable, normalising strategies exist, 
but then also all of the variants of productivity coincide (see Section 3.4). 

Theorem 3.10. The recognition problem for weak productivity is complete. 

Proof. For the proof of -hardness, let M be a Turing machine. We exchange 
the rule run(T, ok(a;), ok(?/)) run(A/(a;, y), ok(y), pickn) in the TRS TZootM 
from [3] by the rule run(T, ok(a-), ok{y)) : run(M(x, y), ok{y), pickn). Then we 
obtain that the term run(T, pickn, pickn) is weakly productive if and only if >m 
is not well-founded (which is Ej-hard to decide). This concludes E}-hardness. 

The remainder of the proof proceeds analogously to the proof of Theorem 3.8, 
except that we now have an existential function quantifier 3r to quantify over 
all rewrite sequences of length ui. Hence weak productivity is in E}. □ 

3.4 Discussion 

For orthogonal TRSs all of the variants of productivity coincide. That is, if 
we restrict the first variant to computable outermost-fair strategies; as already 
discussed, other strategies are not very reasonable. For orthogonal TRSs there 
always exist computable outermost-fair strategies, and whenever for a term there 
exists a constructor normal form, then it is unique and all outermost-fair rewrite 
sequences will end in this unique constructor normal form. 

This raises the question whether uniqueness of the constructor normal forms 
should be part of the definition of productivity. We consider a specification of 
the stream of random bits: 

random : random random 1 : random 

Every rewrite sequence starting from random ends in a normal form. However, 
these normal forms are not unique. In fact, there are uncountably many of them. 
We did not include uniqueness of normal forms into the definition of productiv- 
ity since non-uniqueness only arises in non-orthogonal TRSs when using non- 
deterministic strategies. However, one might want to require uniqueness of nor- 
mal forms even in the case of non-orthogonal TRSs. 

Theorem 3.11. The problem of determining, for TRSs R and terms t in R, 
whether t has a unique (finite or infinite) normal form is H\-complete. 

Proof. For Hj-hardness, we extend the TRS constructed in the proof of The- 
orem 3.10 by the rules: start run(T, pickn, pickn), rur\{x, y,z) run{x,y, z), 
start ones, and ones 1 : ones. Then start has a unique normal form if and 
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only if >M is well- founded. For IlJ-conipleteness, we observe that the property 
can be characterised by a n}-formula: we quantify over two infinite rewrite se- 
quences, and, in case both of them end in a normal form, we compare them. 
Note that consecutive universal quantifiers can be compressed into one. □ 

Let us consider the impact on computational complexity of taking up the 
condition of uniqueness of normal forms into the definition of productivity. In- 
cluding uniqueness of normal forms without considering the strategy would in- 
crease the complexity of productivity with respect to a family of strategies to 
However, we think that doing so would be contrary to the spirit of the no- 
tion of productivity. Uniqueness of normal forms should only be required for the 
normal forms reachable by the given (non-deterministic) strategy. But then the 
complexity of productivity remains unchanged, Ilj-complete. The complexity of 
strong productivity remains unaltered, n}-complete, when including uniqueness 
of normal forms. However, the degree of undecidability of weak productivity in- 
creases. From the proofs of Theorems 3.10 and 3.11 it follows that the property 
would then both be Sj-hard and Hj-hard, then being in Aj. 

4 Productivity for Lazy Stream Specifications is II" 

In this section we strengthen the undecidability result of Theorem 3.5 by showing 
that the productivity problem is Hj-complete already for a very simple format 
of stream specifications, namely the lazy stream format (LSF) introduced on 
page 3. Wc do so by giving a translation from Fractran programs into LSF and 
applying Theorem 2.2. 

Definition 4.1. Let P = . . . , be a Fractran program. Let d be the least 
common multiple of the denominators of P, that is, d := lcm(gi, . . . , q^). Then 
for n = 1, . . . , d define p'n = Pi' {d/ qi) and 6„ = n • ^ where ^ is the first fraction 
of P such that n ■ ^ is an integer, and wc let and &„ be undefined if no such 
fraction exists. Then, the stream specification induced by P is a term rewriting 
system TZp = {Ep,Rp) with: 

Up — {•,:, head, tail, zip^, Mp} U {modp^ | is defined} 

and with Rp consisting of the following rules: 

Mp zip^(Ti, . . . , T^), where, for 1 < jt. < d, T„ is shorthand for: 

^ _ ('modp;(tail^"~\Mp)) if pj, is defined, 

: modd(tail""^(Mp)) if p'„ is undefined. 

head(a; : a) ^ x modfc(cr) head{a) : modfc(tail'^((T)) 

tail(a- : cr) ^ (T z\pa{ai,a2 ...,(7d)^ head((Ti) : zip^(cr2, . . . , fid, tail(cri)) 

where x, cr, ai are variables.^ 

^ Note that mod£j(tair~^(zip^(Ti, . . . , T^))) equals Tn, and so, in case p'^ is undefined, 
we just have T„ = • : T„. In order to have the simplest TRS possible (for the purpose 
at hand), we did not want to use an extra symbol (•) and rule (•)—>•: (•). 
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The rule for mod„ defines a stream function wliicli takes from a given stream 
cr all elements cr(i) with i = (mod n), and results in a stream consisting of 
those elements in the original order. As we only need rules modpj^ whenever p'^ 
is defined we need d such rules at most. 

If p'^ is undefined then it should be understood that m • is undefined. For 
n G N let ip{n) denote the number from {1, . . . , d} with n = ip{n) (mod d). 

Lemma 4.2. For every n > we have fp{n) = \_{n — • P^(„) + btp(n)- 

Proof. Let n > 0. For every i G {1, . . . ,k} we have n ■ ^ G N if and only if 
f{n) • — S N, since n = ip{n) mod d and d is a multiple of qi. Assume that fp{n) 
is defined. Then/p(n) = n-p'^^^^/d = {[{n-l) I d\-d+{{n-l) mod d) + l)-p;(„)/d 
= L("--l)/'^J--P'^(„)+<^("-)--P«/'?i = L('^-l)/c^J--P^(„)+^v(«)- Otherwise whenever 
fp{n) is undefined then is undefined. □ 

Lemma 4.3. Let P be a Fractran program. Then TZp is productive for Mp if 
and only if P is terminating on all integers n> 0. 

Proof. Let cr(n) be shorthand for head(tair(tT)). It suffices to show for all n E N: 
Mp{n) — >* • if and only if P halts on n. For this purpose we show Mp{n) ^+ • 
whenever fp{n + 1) is undefined, and Mp(7i) ^+ Mp{fp{n + 1) — 1), otherwise. 
We have Mp(n) T^(„+i)([n/dJ). 

Assume that fp{n + 1) is undefined. By Lemma 4.2 p^^^^j^-j is undefined, 
thus thus Mp(n) ^* • whenever [n/d\ = 0, and otherwise we have: 

Mp{n) ^* T^(„+i)(KdJ) ^* modd(tail'^("+i)-i(Mp))(L7VrfJ - 1) ^* Mp(n') 

where n' ~ {\n/d\ — 1) ■ d + ip{n + 1) — 1 = n — d. Clearly n = n' (mod d), and 
then Mp{n) • follows by induction on n. 

Assume that fp{n + 1) is defined. By Lemma 4.2 p'^(^„^i-j is defined and: 

Mp(n) ^* T^(„+i)(Ln/dJ) ^* mod^,^^^^^^ (tail^-'"+^>-i(Mp))( [n/dj ) 

and hence Mp(n) ^+ Mp{n') with n' = [n/dj • + &ip(ri+i) ^ 1- Then we 

have n' = fp{n + 1) — 1 by Lemma 4.2. □ 

Theorem 4.4. The restriction of the productivity problem to stream specifica- 
tions induced by Fractran programs and outermost-fair strategies is Yi^-complete. 

Proof. Since by Lemma 4.3 the uniform halting problem for Fractran programs 
can be reduced to the problem here, Ilj-hardness is a consequence of Theorem 2.2. 
Ilj-completeness follows from membership of the problem in n§, which can be 
established analogously as in the proof of Theorem 3.5. □ 

Note that Theorem 4.4 also gives rise to an alternative proof for the Il^-hardness 
part of Theorem 3.5, the result concerning the computational complexity of 
productivity with respect to strategies. 
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